Skip to content

[pull] master from DataDog:master#632

Merged
pull[bot] merged 5 commits into
ConnectionMaster:masterfrom
DataDog:master
Jul 1, 2026
Merged

[pull] master from DataDog:master#632
pull[bot] merged 5 commits into
ConnectionMaster:masterfrom
DataDog:master

Conversation

@pull

@pull pull Bot commented Jul 1, 2026

Copy link
Copy Markdown

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

dkirov-dd and others added 5 commits July 1, 2026 14:25
* Update TagManager to handle internal and db tags correctly

* Migrate Postgres over to TagManager

* Remove logic that exists in DatabaseCheck now

* Add changelog

* Fix can_connect service check test
Adds 5 new GPU NCCL collective metrics emitted by the Datadog Agent
NCCL check (pkg/collector/corechecks/gpu/nccl) under the gpu.nccl.*
namespace (migrated from the legacy nccl.collective.* prefix):

  gpu.nccl.collective.algo_bandwidth_gbps  - GB/s algorithmic bandwidth per rank
  gpu.nccl.collective.bus_bandwidth_gbps   - GB/s bus bandwidth per rank
  gpu.nccl.collective.exec_time_us         - µs execution time per rank
  gpu.nccl.collective.msg_size_bytes       - bytes message size per rank
  gpu.nccl.rank.seconds_since_last_event   - seconds since last event (hang detection)

Inserted alphabetically between gpu.memory.temperature and gpu.nvlink.count.active.
* Add handling for explicit Postgres image tag for testing

* Remove 19
@pull pull Bot locked and limited conversation to collaborators Jul 1, 2026
@pull pull Bot added the ⤵️ pull label Jul 1, 2026
@pull pull Bot merged commit 0a4bb24 into ConnectionMaster:master Jul 1, 2026
@pull pull Bot had a problem deploying to typo-squatting-release July 2, 2026 06:40 Failure
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants